# Local deployment optimization

Bielik 4.5B V3.0 Instruct GGUF
Apache-2.0
Bielik-4.5B-v3.0-Instruct-GGUF is a Polish large language model released by SpeakLeash, converted from Bielik-4.5B-v3.0-Instruct to GGUF quantized format, suitable for local inference.
Large Language Model Other
B
speakleash
693
4
Apriel 5B Instruct Llamafied
MIT
This is an approximate implementation version of the ServiceNow-AI's Apriel-5B-Instruct model converted to Llama format, compatible with mainstream fine-tuning frameworks for easier operation.
Large Language Model Transformers
A
mrfakename
63
3
Huihui Ai Gemma 3 1b It Abliterated GGUF
This is a quantized version of Google Gemma 3B model, optimized based on llama.cpp, suitable for running in resource-limited environments.
Large Language Model
H
bartowski
3,123
3
Deepseek R1 GGUF
MIT
DeepSeek-R1 is a 1.58-bit dynamically quantized large language model optimized by Unsloth, adopting the MoE architecture and supporting English task processing.
Large Language Model English
D
unsloth
2.0M
1,045
E5 Base V2 Gguf
MIT
GGUF format file of the e5-base-v2 embedding model, used for tasks such as sentence similarity calculation, supporting a maximum context of 512 tokens.
Text Embedding English
E
ChristianAzinn
168
2
Polka 1.1b Chat
MIT
The first Polish dialogue assistant model specifically designed for local deployment, based on TinyLlama-1.1B with extended Polish tokenizer and trained with DPO optimization
Large Language Model Transformers Other
P
eryk-mazus
91
19
GPT NeoX 1.3B Viet Final GGUF
1.3B parameter GPT-NeoX model pretrained on 31.3GB Vietnamese data
Large Language Model English
G
afrideva
170
1
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase